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In this article, we sliow tiiat a linear combination X of n independent, unbiased Bernoulli random variables 
{Xj.} can match the first 2n moments of a random variable Y which is uniform on an interval. More generally, 
for each p>2, each Xj. can be uniform on an arithmetic progression of length p. All values of X lie in the range 
of y, and their ordering as real numbers coincides with dictionary order on the vector {X-i ,.. . ,X„). 

The construction involves the roots of truncated (^-exponential series. It applies to a construction in numerical 
cubature using error-correcting codes For example, when n = 2 and p = 2, the values of X are the 4-point 
Chebyshev quadrature formula. 



1. INTRODUCTION 

One of the standard proofs of the central Hmit theorem es- 
tabHshes that the moments of the normaHzed sum 

Xi+X2 + ...+X„ 
\fn 

of n i.i.d. centered random variables with finite moments con- 
verge to the moments of a Gaussian random variable. This fact 
raises the question of when the first n moments of a random 
variable Y can be matched by a linear combination of indepen- 
dent copies of another variable X. It is an easy exercise with 
cumulants that this is impossible when Y is Gaussian and X is 
not Gaussian. In this article we will show that for every n, Y 
can be the uniform distribution on an interval if X is unbiased 
Bernoulli. More generally X can be uniform on an arithmetic 
progression of length p for any p > 2. 

We conjecture that the moments of most absolutely con- 
tinuous distributions cannot be matched by those of a linear 
combination of Bernoulli random variables. In this sense the 
uniform distribution on an interval has "special moments". 

Theorem 1. Let p > 2 be an integer, let X be a uniformly 
random variable on the set 

{p-l,p-l>,p-5,...,\-p}, 

and let Xi,X2, ■■■ ,X„ be independent copies ofX. Then there 
exist unique constants 

fll > (32 > • • • > flf! > 

such that the first 2n moments of 

X = aiX\ + fl2^2 + 1" l^nXn 

agree with the first 2n moments of a random variable Y which 
is uniform on [—1, 1]. Moreover 
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Figure I: The values of X marked on a ruler when p = 2 and n = 3. 



The second part of Theorem[2implies that all p" values of 
X lie in the interior of the interval [—1,1]. Moreover, if we 
cut [—1,1] into p" equal subintervals, then each subinterval 
contains exactly one value of X. This pattern is summarized 
by the relation 

xe{txjp-'-p-'\txjp-^+p-") c (-1,1), 

7=1 j=l 

which is easily equivalent to the inequality in Theorem^ 

A more combinatorial formulation is as follows: If we num- 
ber equal subintervals of [— 1 , 1] from to — 1 and if a par- 
ticular value of X lies in subinterval k, then the jth digit of k 
in base p is determined by the jth variable Xj. In symbols, 

Figure[Oshows an example. If p — 2 and n — 3, then 

X « ±.500128 ± .243941 ± .153942, 

where the signs are independent and random. It is natural to 
think of [—1,1] with its p" subintervals as a ;?-adic ruler. In 
Figure[2 the 8 values of X with p~2 and n — 3 are marked on 
a dyadic ruler. The figure bears out Theorem[2because each 
mark lies in a different gap between ruler marks. The marks 
appear to be at a fixed distance from the ruler marks, but this 
is only approximately true: The first two terms of X are close 
to ±5 and but not exactly equal. In the general case, 
Lemma|5]below establishes that the coefficient aj is extremely 
close to the (halved) ruler spacing p^j when j is small: 

ajpj ^l±Oip-'''-''), (1) 
where k = n + I — j. 
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Let Zp n be the range of X. Since X has the same first 2n 
moments as Y, indeed trivially the same (2n + l)st moment as 
well, the equation 

^■'-^ P xeZp,„ 

holds for any polynomial P of degree at most 2n + 1 . A 
weighted set Z with this property up to some degree t is 
called an (interpolatory) t- quadrature formula. E.g., Simp- 
son's rule and Gaussian quadrature are standard quadrature 
formulas. Our quadrature formula Zp „ is highly inefficient 
for general p and n, but its special structure is useful for the 
higher-dimensional cubature problem for integration on the k- 
cube [— 1 , 1]*^. Elsewhere 1 3] we combine the product formula 
•^2 n ^^^^ binary error-correcting codes, in particular extended 
BCH codes, to obtain a (2n + l)-cubature formula on [— l,!]*-' 
with equal weights and 0{k") points. (The asymptotic bound 
is with n fixed and k °o.) 



This framework is designed so that first, cumulants carry the 
same information as moments, and second, cumulants are ad- 
ditive, i.e., 

Kx{t)+KY{t)^Kx+Y{t), 

for independent random variables X and Y. 

Equation (|3 yields the cumulant generating function equa- 
tion 

which we can write as a relation between individual cumu- 
lants: 

KkiX) + KkiY)^p''Kk{Y). 

The odd cumulants of X and Y vanish since they are even 
random variables. The even cumulants thus satisfy the relation 

K2kiX)^ip''~l)K2kiY). 



2. THE PROOF 



We will write a„ ^ for aj, to make clear that they depend on 



Since 



K2k{X) = '£al'jK2kiX), 



Lemma 2. The random variables X and Y have the same first 
2n moments if and only if 



I "n./ 
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for all I < k < n. 



Proof. Recall that Y is uniformly random on [—1,1] and that 
X is uniformly random on 

{p- l,/'-3,/?-5,...,l -p}. 

Thus if they are independent, then X + Y is uniformly random 

on [—p,p], so 



X + Y = pY. 



(2) 



Here "=" means equality of distribution. 

To understand the implications of this relation between X 
and Y, we review the relations among cumulants, moments, 
and their generating functions 1 1 ] . For a general random vari- 
able X, the Ml moment is denoted fikiX), the exponential gen- 
erating function of all of the moments is the moment func- 
tion Mx{t), the cumulant function Kx{t) is its logarithm, and 
the cumulants Kk{X) are defined by Kx (?) as their exponential 
generating function. In formulas. 



^kiX)'^ E[X'] Mxit)"^E[e^'] = j: 



k=0 



k\ 



Kx{t) = \nMx{t) 



i'^'^Kxit) 



k=0 



k\ 



it suffices for the fl„,/s to satisfy the stated power sum relation. 

This condition is also necessary provided that each 
K2k{Y) ^ 0. To check this, we will estabHsh that K2k{X) ^ 
when p — 2. The moment function with imaginary argument, 
Mx{it), is also called the characteristic function of X (mean- 
ing the Fourier transform of the distribution oiX). In this case, 
its logarithm is: 

Kx{it) = logcosf Kx{it)' = -tanf. 

The relation 

(tanf)' = (tanf)^ + I 

implies that the tangent function has strictly positive odd 
derivatives, so 

{-\f+'K2k{X)>0 

for all k. □ 
For convenience let q — p^, and let: 

bn.j = Cl„i I'll.] = T • 

Lemma|2]can then be restated as 

" 1 

for all 1 < < n. This implies a unique solution for the a„,/s 
provided that each b„ j is real and positive. For convenience 
we will study a polynomial whose roots are r„ j for I < j <n. 
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Lemma 3. If 

n 1 

for all I < k < n, then 
Proo/ Let 

n 

Pk = E ^n.j 

7=1 

be the A:th power sum of the b„ j's, and let be the corre- 
sponding elementary symmetric function, so that 

n n 

fj(l+fe„,/x) = l + ^eix'=. 

7=1 k=\ 

Since the first n elementary symmetric functions determine 
the first n power sums and vice versa, and since our desired 
value 

1 

Pk = ~k 7 

does not depend on n, we can derive each ek by taking the 
limit n — > oo and finding Zjoo./s to match all pk^- Let 

Then 

Pk = E = ^r~T 

since the left side is a geometric series. Moreover 

£ b„.j^b„.j,---b„,j^ 

l<Jl<j2<-<Jk 

1 

^ (^-l)(^2_i)...(^A._l) 

by a routine combinatorial exercise. Another way to recog- 
nize these values of pi^ and is that they are the principal 
specialization of the ring A of symmetric functions ^4, §7.8], 
transported by the fundamental involution (O and a sign invo- 
lution o: 

To conclude, our explicit choice for the booj's establishes that 
the given p^'s are consistent with the claimed e^'s. □ 

To continue the example mentioned first in Section^ 

2 3 

^4,<3W = 1-- + --^. 



Its roots are 

{-^,-^,-^) w (3.997956,16.80465,42.19739), 

^3 1 ^3 ^ ^3 3 

so that 

(03,1,03,2,03,3) « (.500128, .243941, .153942). 
Therefore 

X w ±.500128 ± .243941 ± . 153942 

when p — 2 and n = 3. 

We will need the g'-Pochhammer symbol 01 : 

{a-q)k^luj=:il-aq-^)-' k<0. 

[l k = 

We also define 

Fc,.„ix) = j^—- F^{x) = j^— 

n ^k 00 ^ 
Fq.<n{x) ^ E 7 r F^->n{x) = ^ 7 T" ^^'> 

^o(i,i)k t^+i{qA)k 

(The polynomial F^ <„(jic) was already used in Lemma|3]) We 
will often use the relation 

Fq.n-i{x) 1-q"' 

The function Fg{x) is related to the standard Jackson q- 
exponential function eg{x) |2] by 

1 — ^ 

(In some works, Fa{x) itself is called a <7-exponential.) By the 
proof of Lemma|3l 

Fq{x) = Yl(^-q-^x) - (5) 

This identity holds for all q as an equality of formal power 
series. We will need the stronger fact that it is an equality of 
entire analytic functions when q > I. 

To establish Theorem ^ we would like to understand the 
effect of truncation on the first n zeroes of Fq{x) when q>4. 

Lemma 4. If q> 4, then Fq_<„{x) has n distinct, positive 
roots. 

Proof. It suffices to show that the value 

fk,<n = F,^<„iq''+^) 

alternates in sign as k ranges from to n. The terms of <„ 
are 

qjik+^) 
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These alternate in sign in j and we claim that /jt,<„ has the 
same sign as /t j.. This claim will imply the lemma. 

By equation (|4}, the sequence {l/ij i} is unimodal in j and 
achieves its maximum at j = k. This already implies that fk,<n 
has the same sign as /^j. if ^ = or ^ = n. If 1 <k <n, then 



k-\ 

I ^ fk.i\ < \fk,k-l\ 

Finally 

|/ir,A:-l| + I/m+iI q 



I 51 fl'-A < \fk.k+l\ 
j=k+l 



\fk,k 



q 



1' 



1+1 . 



-T<-<1- 



The first equality once again comes from equation @, while 
the last inequality is the only step that requires q >4 instead 
of merely q > I. Thus in each case /t^ is the dominant term 
inA,<„. □ 

We continue our example case with q = 4- and n = 3 to il- 
lustrate Lemmal^and its proof: 



^4,<3(2) 
^'4,<3(8) 
^'4,<3(32) 
^4,<3(128) 



2 


4 8 1189 




3 + 
8 


45 2835 2835 
64 512 1205 




3 + 
32 


45 2835 2835 
1024 32768 4339 




y " 

128 


^ 45 2835 " 2835 
16384 2097152 


1183085 




45 2835 


2835 



In this example we can see that the ^h sum is dominated by 
its kth term. 

Since f4,<3(x) is a cubic polynomial with four values that 
alternate in sign, it therefore has distinct, positive roots, pre- 
viously noted to be 

('•3,1, '■3,2, '-33) « (3.997956,16.80465,42.19739). 

Lemma^is also implied by the final lemma, Lemma|5] but 
the proof of Lemma|5lis much more complicated. It may un- 
fortunately be as taxing for the reader as it was for the author 



Lemma 5. Let q>4. Let r„ j 



be the roots ofFq <„{x) and let k = n + l — j. Then r„ jq ■' lies 
between 1 and 



Ck 



'\-2q-^ 

l + (^l)H^-(*2') 



k^\ 

k>\ 



(6) 



for every I < j <n. 
Proof. Let 

with m independent of «, and extend this notation to all of the 
definitions in Q- 



The proof consists of three steps. In the first step, we will 
show that the lemma follows from the inequality 



Fq[Ckq^) >Fc.>n{ckq') 



(7) 



and we will show that both sides are positive. In the second 
step, we will show that this inequality follows from the in- 
equality 



V=lim ^'^"'''^ 



> 1, 



Fc,.>„{ckqj) 

where k is fixed in taking the limit. Finally in the third step, 
we will show that 

Step L We claim that equation implies that 

FqSn{x) = F^ix) - Fif^>n{x) (8) 

changes sign as x passes from q^ to Cj^qL In the estimates for 
this step we will assume that 



q^ <x < c/^q^ or cj^q-' < x < q"* , 

except when we explicitly state more general conditions. 
By equation (|5j, F^iq^) vanishes when j > 1- When x > 0, 

Fg.„+l{x) > 

by the definition of /^y.„+i (x). Moreover 

Fg,>„{x)>0 (9) 

because, by equation 0, the series for F^.>„(x) is alternating 
and decreasing when < x < — 1 . Thus 

Fg,>n{q')>o 

since j < n and the argument x = q' is thus in the required 
range. Equation (|8j now tells us both that 



J <x<a^. 



willy-nilly, and that 



Fy,<n{q') < 



Fq.<n{.Ckq-') > 



is equivalent to equation Q. This establishes the first claim 
of this step. 

Equation also tells us that 

Fg.>„{ckq') > 0, 

since x — Cj^q-' is also in the required range < x < q"^^ — 1 . 
Finally we confirm that 

F,{ckq')>Q 
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by counting sign changes in the factors of equation To do 
this properly, observe from equation that q^^ ' < c^q^ < q-' 
when k is odd and qj < c^q^ < q^^^ when k is even. Either 
way, the number of sign changes has the same parity as j+k = 
n + 1. 

Step 2. The goal of this messy step is to reduce equation 
to its asymptotic limit n,j °o with k fixed. We will use 
some preliminary relations for the Pochhammer symbol. The 
product relation 

{a;q),„ = {a;q)(Xaq^;q),„_( (10) 

holds when £ is finite (but m need not be). The inversion rela- 
tion 

H;.). = ^^ (11) 

holds for aU finite £; it follows from the trivial identity 

1 — fl = — fl(l — fl^^). 

The inequality 

{a;q)-i>l (12) 

holds when <£ <oo and <a < q (with equality only when 
£ = 0). Finally 

{a;q)(+i~{a;q)( = -aq^{a;q)( (13) 

for all finite £. We will also use the elementary binomial iden- 
tity 



(14) 



The left side of equation limits to a product of manage- 
able factors: 



Fqickq-') = 



> 



{ckqj;q)-^ 

(-l)"+'(l-c,)fag;g),_, 
{ck;q)-oo 

(-1)^+'(1-Q)g(0cf^ 
{c-^-q)i^j{ck;q)^oo 

(-1)^+'(1-Q)g(2)cf ^ 



(by eq. ([ToJ) 
(by eq. (E)) 



(by eqs. (lT()FT2t 1. 



Meanwhile, the right side of equation Q essentially stabi- 
lizes as a power series in xq^"^^: 



J 



=0 il'-'^)e+n+i 
n+i V i~xYil;q)-e-n-i 

(?=0 q\ 2 I 



A 2 ) i=0 



q 



(0 



(by eq. (E)) 

(by eq. d) 



To isolate the power series, let 

{-tf{\-q)_i,_„^l 



1=0 



q 



(0 



Then 



fll 2 J 



To complete step 2, we will show that G^_„(x) is monotonic 
in n and consolidate inequalities. Observe that 



Gq,m+l {t ) - Gci.,n{t ) — 



i-tYihq)-e-m-2 



to ^(2)+^'+'"+2 

by equation ( I13> . This series is alternating decreasing when 

< f < ^ — q^'^^^ and m > 0, whence 



(f) > G^,,„(f). 



In particular. 



^=0 0^(2) 



when < t < q — q " , by induction on m > n. In particular 

2 j 

because f = c/^q^''^^ is well within range given that ^ > 1, 
Q, < q, and q>4-. 

Thus we have estimates for both sides of equation Q. Upon 
close examination, they sacrifice less and less as n 00. Com- 
bining the estimates. 



Fq.>n {ckqj) [cl {ck\ q)-<^Gi,^oo{ckq-^- 1 ) 



(15) 



The right side is the limit /i^jt defined previously. We have 
shown that the lemma follows from the inequality hg ^ > 1 . It 
is also necessary if our use of the intermediate value theorem 
in step 1 is to work for all n. 

Step 3. We will need the Pochhammer symbol estimate 



{a;q)_l > 1 - 



q~l 



(16) 



We claim that it holds when < a < q and q > 1. By equa- 
tion o 

{a;q)z\ = (1 - aq'^){a;q)\\^ > {a;q)\\i,- aq'^ 

for any £>l, with equality only when £ = I. Thus by induc- 
tion. 



{a;q)_^^ > I — ^ aq 
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again with equality only when £ = I. Now we sum the geo- 
metric series in the limit °°. 
We will also need the estimate 



(17) 



which holds when < f < 1 because the power series for 
Gc/,oo(f) is then alternating decreasing. We apply equations 
M6\ and Mil to the right side of equation M5\ to obtain 



h,,,>{-lf+\l~c,)qi''^')c-,'-'^ 



^ q —l^ ^ q —I' ^ q—l 



when k ^2. The inequality holds when A: = 2 as well, but it 
is not adequate because our proof is a close shave in this case. 
So we will define hq 2 differently. We refine equation (I16> . 

M:i.(i-?)«„-.,):i>(i-?)(i-;^). 



to obtain 



/.,,2>(c2-lkV(l-^)(l-^ 



def- 



q q{q-K 



If we apply equation (|6j to the first occurrence of here, 
we learn that 



when k >2, while 



.-1-^ (1 _ ^) (1 _ ^) (1 _ J_) 



;„1 =2c72(l-^)(l-^)(l-^) 



,,2=4c23(l-^)(l-^)(l-i)( 
^ ^ 1^^ 1'^ q' ^ 



We claim that h^j i, > 1. It can be checked directly with 
symbohc algebra that 

r _ 2q{q^-Aq + 2){q^-2q + 2) ^ 
{q-mq-2Y 

when q>A (indeed when > 3.718), and that 

r _ V(g2-g-l)(g2-2g+2)(g2^2)(g4-2g3-4) ^ ^ 

when ^ > 4 (indeed when q > 3.974). When A; > 2, we claim 
that 



-l-k 



> 



99 

Too 



q-\ 8 



^-1-3 



1- 



c^' 5 

> 77, 



^-1 



To check the first of these inequalities, we apply equation (|6j 
and take the logarithm of both sides. We want to show that 

(i+A)iog(i+(-i)^4^-m)<iog^. 

We can assume that k is even, so that k>4. We can simplify 
using the elementary inequalities 

log(H-x)<^ log--^>x. 

1 —X 



Thus it suffices to show that 



(l+/t)4^"( 2') < 



k+l\ 1 



100 



This holds easily assuming that q >4 and k>4. The other 
three inequalities also hold easily given that q>4. 
Thus when k> 2, 



hg,k > 4 



99 5 5 2 33 



100 8 8 3 32 



> 1. 



This completes step 3 of the proof. 



□ 



We return to our continuing example with q — 4 and n ~3. 
Recall that 



F4,<3ix) = 1 



2 3 
X X X 



3 45 2835' 



and that its roots are 



('■3,i,'-3.2,'-3,3) ~ (3.997956,16.80465,42.19739). 

The roots are alternately below and above 4, 16, and 64; the 
first root is very close to 4, the last one not so close. We can 
expect this pattern because, first, 

^4(4") =0 

for any « > 1, and second, the difference between F4^<j,{x) 
and F4{x), 

F4,>3{x) = 



722925 739552275 

is very small when x is small. The series F4,>3 (x) is also domi- 
nated by its first term even whenx = 64. Since F4,>3(ji:) is pos- 
itive when X < 64, the direction in which it displaces the first 
three roots of F4{x) depends only on the sign of the deriva- 
tive F^{x). (The sign of the derivative f'{x) of any differen- 
tiable f{x) must alternate between consecutive simple roots.) 
Lemma |5] is a careful estimate of the displacement (indeed 
correct to within a universal constant factor). 

To complete the proof of Theorem [2 we return to the def- 
inition of c/i in the statement of Lemma|5]and the convention 
k = n + l- j. Recall that 



q^p 
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We claim that by Lemina|5j 



|p-^-aJ<p-^-|l--L|<_A_ 



„7+4*:-2 ■ 



(18) 



The first inequality is equivalent to Lemma |5] The second 
inequality is far from sharp (equation Q is closer to the truth), 
but it is convenient to prove Theorem ^ To establish it, we 
need the elementary inequalities 



1 



1 X 



1 X 3x2 



for X > 0. (They follow from the Taylor remainder theorem.) 
When ^ = 1 , we want to show that 



1 



:-l<2p 



-2 



This can be established by symbolic algebra for p>2 (indeed 
p > 1.799). When A: > 1 is even. 



1 



1 



v/l+4p-^-(^-+i) 
using that k>2. When > 1 is odd, 
1 



-l<2;7-*'*+i) + 12p 



-2k{k+l) 



<2p 



l-k{k+l) 



<2p 



2-Ak 



using that p>2 and ^ > 3. 

Finally the theorem follows from equation J18t by a geo- 
metric sum: 

t 1^ « I -/I f 2(p-l) 2p 

7=1 *:=! f P ^P^^ 



and 



hm(-l)'^+'(l-Q,)?C2') = (l;^)3_ 



The value of this limit has an interesting interpretation when q 
is a prime power that may or may not be related to the present 
work. Its reciprocal is the limiting probability that 3 indepen- 
dent, random nxn matrices over the field are non-singular. 

That c„jt is monotonic in n follows more directly from the 
interesting recurrence 

This recurrence also shows that c„ is near c„_i j^. This was 
the basis of the author's first attempted proof of a lemma like 
Lemma|5] Such an attempt might yet have merit. 

Finally we conjecture that Theorem^together with its geo- 
metric interpretation has a broad generalization to the mixed- 
base case: 

Conjecture 6. Let pi,p2, ■ ■ ■ ,Pn ^ 2 be a sequence of inte- 
gers. Let Xi,X2, ■ ■ ■ ,X„ be independent random variables such 
that Xj^ is uniformly distributed on the set 

{Pk-^,Pk-^,Pk-5,...,l-pk}. 

Then there are unique constants 

ai > a2 > ■ ■ ■ > a„ > 

such that the first 2n moments of 

X = a\X\ + fl2^2 + 1" fl«^n 

agree with the first 2n moments of a random variable Y which 
is uniform on [—1,1]. Moreover 



as desired. 



3. FINAL REMARKS 



Lip.i-^)Wi-Y\p'k'\<Y\pi' 

j=l k=l k=l 



The proof of Lemma|5]obtains somewhat more information 
about the roots {r„j} of Fy <,j{x) than its statement. The proof 
shows that the sequence 

_ -j _ fn,n+l-k 

Cn,k - rn.jq - „+i_j- 
H 

is monotonic in n for every fixed k. We can also change the 
bound to be the solution to the equation 



{-lY+\l^c,)qi'V)c-y-k 



{c,.^;q)^^{ck;q)-,^GcAckq ') 



in the range q '/^ < q < ^'Z^. (The equation is taken from 
equation il5\ .) Then for this new value of c^, 

lim c„ t = Ck 



Figure 2: The 6 values of X marked on a ruler when n = 2 and 

{puPl) = (2,3) or {pupi) = (3,2). 



For example, we can confirm Conjecture|6]when n = 2 and 
either {pi,p2) = (2,3) or {puP2) = (3,2). If we again let 
r; = fl7^> then in the first case, 

n = 15-2V30 r2=20 + 2\/30 
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and 
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